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Abstract 

In the last decade, there has been a growing interest to use Wishart processes for 
modelling, especially for financial applications. However, there are still few studies on 
the estimation of its parameters. Here, we study the Maximum Likelihood Estimator 
(MLE) in order to estimate the drift parameters of a Wishart process. We obtain precise 
convergence rates and limits for this estimator in the ergodic case and in some nonergodic 
cases. We check that the MLE achieves the optimal convergence rate in each case. 
Motivated by this study, we also present new results on the Laplace transform that 
extend the recent findings of Gnoatto and Grasselli [17] and are of independent interest. 

Keywords : Wishart processes, Laplace transform, parameter inference, maximum likeli¬ 
hood, limit theorems, local asymptotic properties. 
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1 Introduction and preliminary results 

The goal of this paper is to study the maximum likelihood estimation of the parameters of 
Wishart processes. These processes have been introduced by Bru [7] and take values in the 
set of positive semidehnite matrices. Let d G W denote the dimension, Al^ be the set of 
real d-square matrices, iSj" (resp. 5^*) be the subset of positive semidehnite (resp. dehnite) 
matrices, (resp. Ad) the subset of symmetric (resp. antisymmetric) matrices. Wishart 
processes are dehned by the following SDE 

f dXt = [aa^a + bXt + Xtb'^] dt + y/XtdWta + dW^y/X^ t > 0 , , 

1 Ao = X G 5+, ^ ^ 

where a ^ d—1, a G Alrf, b G Md and {Wt)t^o denotes a d-square matrix made of independent 
Brownian motions. We recall that for x G 5^", ^/x is the unique matrix in such that 
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y/x = X. It is shown by Bru [7] and Cuchiero et al. [8] in a more general affine setting that 
the SDE (1) has a unique strong solution when a ^ d + 1 and a unique weak solution when 
a ^ d — 1. Besides, we have Xt G for any t ^ 0 when x G and a ^ d + 1. In this 
paper, we will denote by WISd{x,a,b,a) the law of {Xt,t ^ 0) and WISd{x,a,b,a;t) the 
law of Xt- In dimension d = 1, Wishart processes are known as Cox-Ingersoll-Ross processes 
in the literature. It is worth recalling that the law of X only depends on a through aJa since 
we have 

WISd{x,a,b,a) = WISd{x,a,b,VaTa), 

law 

see e.g. equation (12) in [1], Therefore, the parameters to estimate are a, b and aJa. 

Wishart processes have been originally considered by Bru [6] to model some biological 
data. Recently, they have been widely used in hnancial models in order to describe the 
evolution of the dependence between assets. Namely, Gourieroux and Sufana [19] and Da 
Fonseca et al. [10] have proposed a stochastic volatility model for a basket of assets that 
assumes that the instantaneous covariance between the assets follows a Wishart process. 
This extends the well-known Heston model [21] to many assets. Wishart processes have also 
been used for interest rates models. Affine term structure models involving these processes 
have been proposed for example by Gourieroux and Sufana [20], Gnoatto [16] and Ahdida 
et al. [2]. For these models, the question of estimating the parameters of the underlying 
Wishart process may be important for practical purposes and should be possible thanks to 
the profusion of hnancial data. This issue has been considered by Da Fonseca et al. [9] for the 
model presented in [10]. However, there is no dedicated study on the Maximum Likelihood 
Estimator (MLE) for Wishart processes. For the Gox-Ingersoll-Ross process, the estimation 
of parameters has been studied earlier, motivated in particular by its use for interest rates 
(see Fournie and Talay [14]). Later on, the MLE has been studied by Overbeck [31] including 
some nonergodic cases, and more recently by Ben Alaya and Kebaier [4, 5]. This paper 
completes the literature by studying the MLE for Wishart processes. 

In this paper, we will follow the theory developed in the books by Lipster and Shiryaev [27] 
and Kutoyants [23] and assume that we observe the full path (Xt, t G [0, T]) up to time T > 0. 
This choice will be convenient from a mathematical point of view to study the convergence 
of the MLE. Of course, in practice it can be relevant to study precisely the estimation when 
we only observe the process on a discrete time-grid. This is left for further research, but 
we already observe in our numerical experiments that the discrete approximation of the 
MLE gives a satisfactory estimation of Wishart parameters (see Section 6). It is worth 
noticing that once we observe the path {Xt,t G [0,T]), the parameter aJa is known. In 
fact, we can calculate the quadratic covariation (see for example Lemma 2 in [1]) and get for 
G {I,-- - ,d} 

fT 

{^i,j ^ / (^ ^') T ^') j^k^^s)i^l T (^ ^')i^li,^s]j^k T ^')i^k^^s)j^lds. 

Jo 

( 2 ) 


This leads to 


1 ^ 

(tt J , 


( 3 ) 
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for 1 ^ i,j ^ d and j ^ i. We note that these quantities are well defined as soon as the path 
{Xt,t G [0,T]) has a finite quadratic variation and is such that Xf G dt-a.e., which is 
satisfied by the paths of Wishart processes (see Proposition 4 in [7]). We will assume that 
a^a G 5^"’* and denote by a G an invertible matrix that matches the observed value of 
aJa: a can be for example the square root of aJa or the Cholesky decomposition of a^a. 
Then, we know that Yt = {a^)~^Xta~^ follows the law WISd{{a^)~^xa~^, a, {a^)~^ba^, Id), 
see e.g. equation (13) in [1], It is therefore sufficient to focus on the estimation of the 
parameters a and b when a = Id, which we consider now. 

We first present the MLE ol 9 = {b,a), and we denote by the original probability 
measure under which X satisfies 

dXt = laid + bXt + Xtb~^]dt + VXtdWt + dW^ VXt. (4) 

When no confusion is possible, we also denote P this probability. We consider aQ ^ d + 1 
and set 6q = (ao, 0). We will assume for the joint estimation of a and b that 

a^ d + 1 and x G (5) 

The latter assumption is not restrictive in practice since the condition a ^ d+1 ensures that 
Xt G 5^"’* for any t > 0. Due to this assumption, we know by Theorem 4.1 in Mayerhofer [29] 
that 

:= exp (^j\v[HsdWs] - ^ Trlff.ffjjds^ , with - b^/Xt 

defines a probability measure under which Wt = Wt — Jq HJds is a d x d-Brownian motion, 
where P^ '^ is the restriction of P^i to the cr-algebra cr(Ws, s G [0, T]). We have 

dXt = aolddt + VXtdWt + dWj 

and the likelihood is then defined by (see Lipster and Shiryaev [27], Chapter 7) 

= — ---^^( 6 ) 

E exp (/(f Tr[i7,dW,] - I/(f Tr[77,777]dsj 

where denote the filtration generated by the process X. 

Proposition 1.1. For X G 57’*, let Cx '■ Sd ^ Sd be the linear applieation defined by 
Cx(Y) = XY + YX. It is invertible, and the likelihood of {Xt,t G [0, T]) is given by 

jdfio (oi-aQ /det[Xr]\ o-oo/a + ao , a r^-ii , rp 

Lt = exp(^ log j - - 1 - <*) I 1* - — rut] 

1 pT "I pT 

+ - Tr \cf} (bXt + dXt] -- Tr \Cf} {bXt + Xtb~^'^ (bXt + dt^ . 

(7) 

Lemmas B.l and B.2 states some properties of Cx, and the proof of Proposition 1.1 is given 
in Appendix A. In particular, we see from this proof that ^ only if 6 G Sd, 
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in which case the likelihood has the following simpler form 
' a — cuf) 


T 

Lr^ = exp 


-^-d) Tv[X-^]d. 


CX — OlQ / Ot CXq 


»-^Tr[ 6 ]), ( 8 ) 


since + Xtb) = b. 

Now, we want to maximize the likelihood and observe that the quantity in the exponen¬ 
tial (7) is quadratic with respect to ( 6 , a) and goes almost surely to —oo when ||( 6 , a)|| —)■ -|-oo. 
To do so, we first remark that Tr[ 6 ] = Tr[£^^ {bXt + Xtb~^^] by Lemma B.l. Then, Cauchy- 
Schwarz inequality yields to 


Tr[a 6 ]| = 




(£ 3,1 (bXs + XsbAfX^ 


xr^ 


ds 


Tr 


/o 


1 

2T JO 


= — / Tr 




ds P 


1 


4 T 


Tr [X 7 I] ds 


(9) 


o? 1 


bXs + Xsb'^) {bXs + Xsb'^)\ ds + —- Tr [X,-i] ds, 


and it is strict almost surely, which gives that the quadratic form in the exponential (7) is 
negative definite. There is thus a unique global maximum of (7) on M x A4j. We know from 


Lemma B.2 that £^,1 is self-adjoint, and we get with straightforward calculations that the 
MLE 6t = (bT, (It) is characterized by the following equations: 

/ i log (/o" Tr[X-i]ds - f Tr[bT] = 0, 


/(f C^]{dXs)Xs - So ^xJbrX, + X,b]:)X,ds - = 0. 


( 10 ) 


Unless in the ergodic case, we will not be able to obtain convergence results for this estimator. 
Instead, we will mostly work with the MLE estimator when b is known to be symmetric. This 
enables us to work with more tractable formulas, even if the calculations are already quite 
involved in case. Analyzing the general case would require development of further arguments. 
Besides, we can consider that Wishart processes with b symmetric already form an interesting 
family of processes that may be rich enough in many applications. When b G the unique 
global maximum Ot = {bx, Ht) of ( 8 ) on M x is characterized by the following equations: 


' J log ( w) - PM fl Tr[V-l]* - I Tr[M = 0, 

^ - I {brXs + XsbT)ds - = 0 . 


( 11 ) 


To get more explicit formulas, we have to invert this linear system. For X £ Sd and a G 
we define the linear applications 


■ Sd P Sd and Cx,a ■ 

Y ^YX + XY 

We introduce the following shorthand notation 
rT / t-T 


Sd —t Sd 

Y ^YX + XY -2a Tr[y]Ij. 


( 12 ) 


Rt := 


:= [ Xsds, Qt:=([ Tr[X7V^) , Zt := log (, (13) 
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and note that Qt and Zt are defined only for a ^ d + I while Rt is defined for a ^ d — 1 
and belongs almost surely to By using the convexity property of the inverse, see e.g. 

Mond and Pecaric [30], we have when a ^ d + 1 


Tr 



Qrp 

~Y~ 


a.s. 


(14) 


We get ax = l + d + QT{ZT — ‘2.TTi[bT]) and CRj,^T'^Qj,(bT) = Xt — x — T [QtZr + 1 + d) I^. 
By (14) and Lemma B.l, the latter equation can be inverted, which leads to 

otT = 1 + d + Qt (Zt — 2 T Tr t'^Qt — x — T [QtZt + 1 + d] I^) ] ^ 

< ^ ’ ( 15 ) 

6 t = C'^^^j,2Q^iXT - X-T[QtZt+ 1 + d]Id). 

The estimator of a when a G [d — 1, d + 1) given by the MLE is no longer well defined. 
The same thing already occurs in dimension d = 1 for the CIR process, see Ben Alaya and 
Kebaier [4], However, it is still possible to estimate the parameter b G when a ^ d — 1 
is known. In this case, we denote 9 = ( 6 , a) and do = (0; ci) and get by repeating the same 
arguments that 


Lff° =exp(^ 


Tr 


( bXt + Xtb~^ ) dX, 


Tr 


C-^] [hXt + Xth' ] {bXt + Xtb' ) 


T 


dt - — Tr 
2 


[b]), 


and the MLE is characterized by 


oT 


C-x\dXs)Xs - / C-^^{hTXs + Xsb^)Xsds - —Id = 0 . 

^ ./n “ ^ 


(16) 


When b is known a priori to be symmetric, the likelihood and the MLE are then given by 


hr = CrI, {Xt - X - aTId ). 


Tr[b^Xs]ds - ^ Tr 


(17) 

(18) 


The goal of the paper is to study the convergence of the MLE under the original prob¬ 
ability Pg. To do so, we first consider the case where the Wishart process is ergodic. By 
Lemma C.l, this holds if —{b -|- b'^) G 5^"’* when b G Aid, and the ergodicity is equivalent 
to —b G 5^"’* when b G Sd- Then, we can use Birkhoff’s ergodic theorem to determine the 
convergence of the MLE. Section 2 presents these results for (15) when a ^ d-|-l, for (7) when 
a > d -|- 1 and for both (18) and (16) when a ^ d — 1. Section 3 studies the convergence of 
the MLE in some nonergodic cases, namely when b = Xold with Aq ^ 0 and when b is known 
to be symmetric. More precisely, when 6 = 0, we obtain convergence results for (15) when 
a ^ d -|- 1 and for (18) when a ^ d — 1 . When Aq > 0 , we only obtain convergence results 

^This is obvious when a > d— 1 since Xt G ‘Sj’’* a.s. by Proposition 4 in [7]. For a = d—1, we would have 
by contradiction the existence of vt G Tt such that Vt G [0, T], ujXtUT = 0. This is clearly not possible by 
using the connection with matrix-valued Ornstein-Uhlenbeck in this case, see eq. (5.7) in [7]. 
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for (18) when a ^ d— 1. In all these cases, we analyse the convergence by the mean of Laplace 
transforms. Though limited to some nonergodic cases, we however recover and extend the 
recent convergence results obtained by Ben Alaya and Kebaier for the one-dimensional CIR 
process [4, 5]. In Section 4, we check that the MLE achieves the optimal rate of convergence 
in the different cases by proving local asymptotic properties. Last, we study in Section 5 the 
Laplace transform of {Xt,Rt)- This study can be of independent interest and improves the 
recent results of Gnoatto and Grasselli [17]. 


2 Statistical Inference of the Wishart process: the ergodic case 

When -(6-1-6''') G the Wishart process Xt converges in law when t —)■ -|-oo to the 

stationary law X^o WISd{0, a,0, y/2qoo; 1/2) with q^o = ds for any starting 

point X G iSj" by Lemma G.l. Therefore this is the unique stationary law which is thus 
extremal, and we know by Stroock ([35], Theorem 7.4.8) that it is then ergodic, see also 
Pages [32], Annex A. We introduce the following quantity 

Roo ■= IE0(Woo). 


From the ergodic Birkhoff’s theorem, we have 


Rt a.s 

— -> Roo, as r -Foo. 


Besides, when a ^ d + 1, = 


1 


Ee(Tr[X-b) 


is hnite and satishes 


<3ooTr[Roo^] < 1, 


(19) 


( 20 ) 


due to the convexity property of the inverse, see e.g. Mond and Pecaric [30]. Again, the 
ergodic Birkhoff’s theorem gives 


TQt 


Qoo = 


E0(Tr[X^^])’ 


as T —)■ -|-oo. 


( 21 ) 


This section is organized as follows. First, we study the MLF (15) when 6 is known to be 
symmetric in the cases a > d + 1 and a = d + 1. Then, we focus on the MLF (10) when 
b G Aid and a > d + 1. The analysis follows the same steps and reuses some calculations 
made in the symmetric case. Last, we study the convergence of the MLF when a ^ d — 1 is 
known, in both symmetric and general cases. 


2.1 The global MLE estimator of 6 * = ( 6 , 0 ;) when 6 is known to be sym¬ 
metric 

When 6 G Sd, the ergodicity is by Lemma (G.l) equivalent to —6 G 5^"’*, which we assume 
in this subsection. We have X^o ~ WISd{0,a,0,\/—b ^;l/2) and it is easy to get from (4) 
that aid + bRoo + Roo6 = 0, which gives Roo = “f 6“'^ G We will also show in the 

proof of Theorem 2.1 that 


a — {I + d) 
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We consider the convergence of the MLE given by (15) when a ^ d + 1. We introduce 
the following martingales: 


Mf.= [ ^/x~sdWs+ f dWjVXs, 
Jo Jo 

Nt:= f Tv[{^s)-^dWs]. 

Jo 


(23) 

(24) 


We use the dynamics of {Xt)t^o under and Ito’s formula for {Zt)t^o (see e.g. Bru [7], 
equation (2.6)) to get on the one hand 


Xt = X + cxTId + + Mt, Zt = (a — 1 — d)Qrp + 2Tr[6]T + 2Nt- (25) 

On the other hand, we obtain from (11) and (13) that Xt = x + cxTTId + Cfij.{bT) and 
Zt = [otT — 1 — d)Q^^ + 2TTr[6'r], which yields to 


dj' — a = 2TQx Tr[6 — bx] + 2Qj'Nj' 

— b) = (a — aT)TId + Mt = 2T'^QtTi\Pt — b]Id + Mt — 2TQTNTld- 


(26) 


Theorem 2.1. Assume that —b G SJ' and a > d + 1. Under Pg, y\/T{bT — b,aT — oi 
eonverges in law when T —)■ +oo to the eentered Gaussian veetor (G,H) that takes values in 
X M and has the following Laplaee transform: for c, A G 5^ x M, 


Eg [exp (Tr[cG] + XH)] = exp 


- 2Q^X^ - 2Q^X xr[ci?“'] + Tt[cC^^ (c)] 


Proof. By (14) and Lemma B.l, we can rewrite the system (26) as follows 


y/r^CiT — Cl) = 2 TQt —2Tgj'Tr 


Vt 


c 


-1 


f Pr - 2TQrnN 


TQt \^JT 


Vt 


^{bT-b) = C 


-1 


'^tqt\Vt 

Note that, for i,j, k,l G {1,... ,d} we have 


«--2TQrnN). 


{Alij, — [dji{Rt)i,k P djk{Rt)i,i P dii(^Rt)j^k P dik(^Rt)j^i\ , 

{Mij,N)t = 2% and {N)t = (27) 

where 5ij stands for the Kronecker symbol. 

So, it follows from the central limit theorem for martingales (see e.g., Kutoyants [23], 
Proposition 1.21), that (^^, ^) converges in law under Pg towards a centered Gaussian 

vector (G, H) taking values in 5^ x M such that 

MGi,jGk^) = [5ji{R oo)i,/c ^jk{^Roo)i^l “1“ ^il{_Roo)j^k H“ ^ik{Roo)j,l\ ; 

EeiGijH) = 26ij and Eg{H^) = Q"\ 


( 28 ) 



2 STATISTICAL INFERENCE: THE ERCODIC CASE 


From (25) and (21), we obtain (22). From Lemma B.l, the function {X,Y,a) i—)■ is 

continuous, and we get by Slutsky’s theorem that {VT{bT — b), 'JT^q.t — oi)) converges in 
law to the Gaussian vector 




G-2Q^Hh],2Q^[H-TT 


- 

Roo,Q^ 


G - 2Q^Hh 


We are interested to calculate the Laplace transform of this law. First, we calculate the 
Laplace transform of {G,H): 


Vc G Sd,XG M,E 0 



+ A.^) 


= exp 



+ 4ATr[c] +4Tr[c2i?oo]) 



We want to calculate for c G Sd and A G M, 


Eg [exp (Tr[cG] + \H)] = Eg exp TV[(c - 2\Q^Id)G] + 2\Q^H 


Due to (20) and Lemma B.l, we can introduce c = (c — 2XQ^Id). We have 

^oo )Woo 


RooC + cRoo - Tr[c]/d = c - 2XQ^Id, 


and thus 

Tr[(c - 2XQ^Id)G] = Tr[(i?ooC + cRoo - 2Q^ Tr[c]/rf)G] 

= Tr[c(7?ooG + Gi?oo - 2Q^ Tr[G]/rf)] = Tr[c(G - 2Q^HId)]. 

We therefore obtain from (29) 

Ef) [exp (Tr[cG] + XH)] 


= Eg 
= Ee 


exp (Tr[c(G - 2Q^HId)] + 2XQ^H 
exp (Tr[cG] + 2Q^(A-Tr[c])iL) 


= exp (2 {(A - TT[c]fQ^ + 2(A - Tr[c]) Tr[c]Q^ + Tt[c^Roo]}) ■ 
Since 2Tr[c^iioo] = Tr[c(ci?oo + Rooc)] = Tr[cc] + 2Q^(Tr[c] - A) Tr[c], we get 
Ee [exp (Tr[cG] + XH)] = exp (2A(A — Tr[c])Qoo + Tr[cc]) . 


Qoc^o 


We now use that C—^ — (Id) = -— _i to get c = C—^ — (c) — A— _i . 

Since we have Tr[£G^ (c)] = -by Lemma B.l, this yields to the claimed 

Roo,Qao 2(1-Q,„ Tr[iJ^ ]) 

result. □ 


When a = d+1, the rate of convergence of the MLE of a is even better as stated by the 
following theorem. 

Theorem 2.2. Assume —b G S^’* and a = d+1. Then, undergo, (^\^{bT — b),T{aT — a)^ 

eonverges in law when T -G +oo to ^G, Tr[6]^, where Ta = inf{t >0, Bt = a} with 

(Bt) t>o a given one-dimensional standard Brownian motion and G is a Gaussian veetor 
independent of B sueh that Eg [exp (Tr[cG])] = exp (c)]!; c G Sd- 
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Proof. By (14) and Lemma B.l, we can rewrite the system (26) as follows 


T{aT-a) = 2 T^Qt 


N'T 


-j=TT 

Vt 


-1 


\ VT 


(^ - 2T^I-QrI,^ 


T 


VTibr-b) = 


^,TQt 




(30) 


From (25), we have 


_ 1 / det[Xr] \ 
T 2T ^ V det[x] J 


-TV[6]. 


As for —b £ S^’* the Wishart process {Xt)t>o is stationary with invariant limit distribution 
X^o we easily deduce that ^ converges in probability to — Tr[6] when T —)■ oo. Then, it 
follows from (19) that 


{T-^Rt,T-^Nt)^ iRoo,-TY[b]), asT^oo. (31) 

Hence, we only need to study the asymptotic behavior of the couple (T~^^'^Mt,T‘^Qt)- 
According to Theorem 4.1 in Mayerhofer [29], we have for A ^ 0 and T G S^ 


Ee 


A A^ 
exp ( -Nt - 


+ ^ rvirMrl - I 


T 2A 

TT[r^Xs]ds-^TT[r] 


= 1. (32) 


Now, let us introduce the quantity 


Ax = Eg 


exp (^A^ + ATr[6]^ exp Tt[TMt]^ 


X exp ( -- / Tr[r"X,]ds + 2Tr[r"4?oo] 
^ JO 


Then, by (32) we easily get At = exp ^ATr[6] + 2Tr[r^i?oo] + ^ Tr[r]^ . We now write 
At = At + Ee exp Tr[rMT]^ 


At = Eg 


with 
A2 


(exp {fx) - 1) exp ( TyITMt] 


ix = + ATr[6] - [ TY[T^Xs]d.s + 2Tr[r2i?^]. 

J J JO 

Cauchy-Schwarz inequality and > 0 give 

|Ar| ^ Ey^[exp (2^r) - 2exp (^t) + IjE^^ 

On the one hand. Proposition 5.1 with m = —b £ St’* gives 


exp ( -^Ty[TMt] 


Eg 


exp ( ^ Tr[rMr] 


^ Eg 


exp( ^Tr[r2i?r] 


< oo. 
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On the other hand, we have for any r ^ 0 
Eg [exp (r^r)] ^ Eg 
From (25), we have 


I AT 

exp ( 


ex 


p{2rTv[T^R^]). 


Eg 


f AT 

exp ( 


= exp(—Ar Tr[6])E6i 


'/ det[XT] \ 
\ det[x] / 


Ar 

2T 


The sublinear growth of the coefficients of the Wishart SDE and the convergence to a sta- 


tionary law gives that E^ 


det [ai] J 


is uniformly bounded in T > 0, A < 1 and there¬ 


fore sup 2 ,^>^ Ef) ^ < oo. This gives the uniform integrability of the family 

(exp (2^r) ,T > A). Then, we deduce from (31) that E6i[exp (2^7^) — 2exp(^'r) -|- 1] —)■ 0 

T^+oo 

and thus At —)■ 0. 

T^+oo 

Hence, we obtain 


lim Eq 

T^OO 


exp 


-Nq-^ + 


^ = ^lim At = exp (ATr[6] -|- 2Tr[r^i?oo]) • 


Therefore, we deduce by Lemma B.4 the following convergence in law 


Qt^ Mt 

^’Vf 


- Tr[f,] : 


R^G + G'^X Rc 


as T —)■ oo. 


where Gij 1 ^ i, j ^ d are independent standard normal variables. Together with (31), we 
obtain that 

{T-^Rt,T^Qt,T-^Nt,T-^/^Mt) => (i?oo,l/r_^,[,,,-Tr[6],vtG + GTyi^), (33) 
which gives the claim by (30) and Lemma B.4. □ 


2.2 The global MLE estimator of 6* = (6, a) when b G Aid 

We define the linear operators Cx, Tx,a ■ Aid —t Aid by 

CxiY) = C],\YX + XY^)X, Cx,a{Y) = ^x(T) - aTr[y]/rf. 

From (4), we get Zt = (a — 1 — d)Qlp^ + 2 Tr[6]T -|- 2Nt- This yields with (10) to 
( oiT — oi = 2 TQt Tr[6 — bA + 2QtNt 

(34) 

[ CxAbT-b)ds-T^QTTv[bT-b]Id = C^]{dMs)Xs - TQTNTld- 


CTiY) 


1 

f 



CxAY)ds-TQTTr[Y]Id, 


We now define 
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which is a linear operator on By using the convexity of the inverse function, there 

exists 7 G (0,1) that depends on {Xs,s £ [0,T]) such that TQt = 7 ds. We get 

£t(T) = h Cx (V)ds. By Lemma B.3, £t is self adjoint and positive. It is even 

‘’’TYptyh 

positive definite since Tx[CTiX)^T] = 0 implies by using Lemmas B.3 that YXs + XsY~^ = 0 
a.s. on [0, T] under and therefore the quadratic variation of Tt[uYX g] is equal to zero for 
any u £ Sd- This gives Tr[{uY + Y~^u)Xs{uY + y'''rt)](is = 0 and thus uY + Y~^u = 0 for 
all u £ Sd, which necessarily implies Y = 0. Thus, we rewrite (34) as 


VT{aT-a) = -2TQTTv[VT{bT-b)] + 2TQT^ 
Vf{bT-b) = C^^[^f^C^]{dMg)Xg-TQT^Id). 


We will assume —{b + b'^) £ S^’* and know from Lemma C.l that Xt converges in law under 
Pe to the stationary law X^o ~ WISd{0, a, 0, ^/2q^-, 1/2). We define 


C^{Y) = T.e[Cx^{Y)]- Q^Ti[Y]Id. (36) 

Note that for Y £ Sd, CooiY) = YR^o —Qoo Tr[y]/d. From the convexity of the inverse func¬ 
tion, = yEefl/Tr[X“i]] with 7 G (0,1), and thus £ 00 (T) = 'EqICx^ _^(^)] is a self- 

adjoint positive operator by Lemma B.3. It is even positive definite since Tr[£oo(T)''~y] = 0 
implies by Lemma B.3 that YX^o + = 0 almost surely. Since the law X^o has a 

positive density on this gives Yu uY~^ = 0 for any u £ Sd and thus Y = D. 

Theorem 2.3. Assume —(b+b~^) £ S^’* and a > d+1. Then, underPg, (^yripT — b),T(aT — a 
eonverges in law when T -£■ -|-oo to the eentered Gaussian veetor (G,iL) that takes values in 
Aid X M and has the following Laplaee transform: for c, A G Aid x M, 


E6»[exp(Tr[c'''G] -|- XH)] = exp 


2gooA2 


1 - <5oo Tr[i?J] 


-Tr[ci2“'] 

1 - Qoo Tr[i?oo ] 


(37) 


+ -E,[Tr[£3,^(£-^(c)Xoo +Xoo£-^(c) ' )(£-^(c)Xoo + Xoo£-^(c) ' )]] 


-k 


1 - Qoo Tr[i?/o^] 


1 


Tr[ci?/^ 


-Tr[£-Hc)] 

2l-g^Tr[i?^] 


Proof. From the ergodic Birkhoff’s theorem, Ct converges almost surely to Coo, and thus 
Cf} converges almost surely to We define the martingale Mt = Cf^{dMs)Xs. We 
have 


d{Ms)i,j 


Tr[el^Cf,]{dMg)Xg\ = ]^TT[Cf,\{e,,Xg + Xgel^)dMg] 

2 A Xse^ j))k,i{dMs)k,i- 

l<k,l<d 


We get from (27) that 

{d{Ms)ij,d{Ms)i',j' 


= -Tr 
2 


£xs(®ki"^s + XseJj){eiijiXs + XgeJji) 
{d{Ms)ij, dNs) = TY[Cf}^{ei^jXs + Xse]^j)\ds = Sijds. 


ds, 
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From the central limit theorem for martingales, converges in law under Pg towards 

a centered Gaussian vector (G, H) taking values in Ai^i x M such that 

1 . 


(38) 


— -Ee |^Tr + Xooejj)(ej/j/Xoo + Xooe^/J/) 

Ee{Gi,jH) = 5ij and Ee{H^) = Q“\ 

We thus have the following Laplace transform for c G Ai^, A G M, 

E^,[exp(Tr[c^G] + AiL)] = exp {^Ee[Ti[C-^^JcX^ T XooC^){cX^ + X^c^)]\ + ATV[c] + ^A^Q”^ 

(39) 

By using Slutsky’s Theorem, we get from (35) that ^\^{bT — b, oct ~ ck)^ converges in law 
under when T —)• +oo to the centered Gaussian vector 


G,H) = { £-i(G - Q^Hh),2Q^ ( H - Tr[£-^(G - Q^Hh 


’-1/ 


Now, we use that C^{I(i) = —=— ^ __i R^o and that Coo is self-adjoint to get for c G 


QooHTt[c^R-o 


2Qc 


r^-l. 


Tr[cTG] = TV[£-1 (c)TG] - ^ (^ - Tr^G])]. 

1 - Qoo Tr[i?oc ] 1 - Qoo Tr[^oo ] 

From (39), we obtain after some calculations (37), using in particular that for m G Aiii, s G S^, 
E 0 [Tr[£ 3 ^^((m -h s)Xoo + Xoo{m + s)'^)((m -k s)Xoo + Xoo{m + s)"^)]] (40) 

= E 0 [Tr[£^^ {mXoo + XoomJ){mXoo + Xoom^)]] + 2 Tr[s(mi?oo + Room^)] + 2 Tr[s^i 2 oo] 


-F^-l 


and taking m = Co}(c) and s = —A— _, Ror^. 


□ 


Remark 2.1. It is interesting to eompare Theorems 2.1 and 2.3 and see that the asymptotie 
varianee ofVT{aT — a) is the same in both eases. Instead, for the estimation of the symmetrie 
part of b, we ean eheek that the asymptotie varianee is greater when we do not know a priori 
that b is symmetrie. For c G S^, we have c = (c)Roo + -Roo'Cjjo^(c)''~] “ Qoo Ti'[^Jjo^(c)]/rf 


-F^-l 


and c = C-^ (c)i?oo + Rookl-o (c) - 2Q^ Tr[£- - (c)]/^. Multiplying by , we 

^OC jW oo -^OOjWoO ^OC iW oo 

get 


Tr[£-'(c)] = 


Tr[ci?J] 


:3T- = 2Tr[£ 




and then C^(c) = 2C-A ^ 

^ Rcc,Qa 


l-Q^Tr[i?^] 

(c) -k A with ARoo + RooA~^ = 0. This gives from (40) 


-E,[Tr[£3,^(£-i(c)Aoo + Aoo/:-'(c)^)(£-'(c)Aoo +Xoo£-nc)^)]] 


+ 


1 TV[cii-'] 


^^[cRJ]Qoo 

1-QooTt[r2] V^l-QooTr[lR"1 


r - Tr[£-nc)] 


=2Tr[£z^__^(c)^i?oo] + -E,[Tr[£^^(AXoo + AooA ' )(AAoo + Ao^A ' )]] - 2Q^ Tv[C^l^^Jc)Y 
= Tr[£z^ _ (c)c] + ^E,[Tr[£^^(AAoo + AooA^)(AAoo + AooA^)]] > (c)c], 


Roo,Qo 

-. — I 


sinee Cj} is a self-adjoint positive operator. 
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2.3 The MLE estimator of b 

When a G [d — 1, d + 1), we are no longer able to study the convergence of the MLE of a. It 
is however still possible to get the speed of convergence of the MLE of b. 

Theorem 2.4. Assume that —b G S^’* and a ^ d — 1. For T > 0, we eonsider Lt defined 
by (18). Then, under Pe, 'jTihT — b) eonverges in law to a eentered Gaussian veetor G on 
Sd with the following Laplaee transform Eg [exp (Tr[cG])] = exp ^Tr[c£^^ (c)]^; c G Sd- 

Assume that —{b + b~^) G S^’* and afid—1. ForT > 0, we eonsider bx defined by (16). 
Then, under ^/T{bT — b) eonverges in law to a eentered Gaussian veetor G on Aid with 
the following Laplaee transform: for c G Aid, Eg [exp (Tr[c''~G])] 

= exp QEe[Tr[£-i^(£-i(c)Xoo + Xoo£-nc)^)(T-Hc)^oo + Woo£-'(c)T)]]) , 

with £oo(c) = Ee[£x^(T)]. 

Proof. We could prove the result for (18) by using the explicit Laplace transform Proposi¬ 
tion 5.1. Here, we use the same arguments as before based on the ergodic property. From (18), 
we have 

VribT -b)=Cfil (Xt-x- bRT - Rrb - aTh)^ = . 

As in the proof of Theorem 2.1, converges in law to the centered Gaussian vector G 
dehned by (28). Slutsky’s theorem and (19) give then the convergence of \/T(&r ~ b) to 
G = (^)’ Laplace transform is given by Lemma B.4. 

Similarly, we get from (16) that 'JT'ifiT—b) = Cf^{MT/VT) with CriX) = y fj' ^XsX)ds. 
We easily check that Tr[£r(T)y] > Tt[Ct{Y)Y] and Tr[£oo(y)y] > Tr[£oo(y)y] for 
Y G Aid- Therefore, Ct and Coo are positive dehnite and thus invertible. Besides, the 
ergodic theorem gives that Cf} converges almost surely to Coo- The result follows from (39) 
and the self-adjoint property of Coo- D 

3 Statistical Inference of the Wishart process: some noner- 
godic cases 

This section studies the convergence of the MLE in the case b = bold with bo 0. When 
6 o = 0 and a ^ d -|- 1, we are able to describe the rate of convergence of the MLE of (6, a) 
given by (15), when b is known to be symmetric. When bo > 0 and a ^ d — 1, we can 
also obtain the rate of convergence of the MLE of b given by (18). Last, when b is known 
a priori to be diagonal, the MLE of b has a simpler form and we can describe precisely its 
convergence. 

3.1 The global MLE of 6 = (h, a) when 6 = 0 

The following result provides the asymptotic behavior of the estimator of the couple when 
a > d -|- 1 and 6 = 0 in (4). 
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Theorem 3.1. Assume that 6 = 0 and a > d + 1. Let {St, ax) be the MLE defined by (15). 
Then, {Tfix - 6), Y^log(r)(dr — a)) eonverges in law under Pg when T —)■ +oo to 


C-J (x1 - ah) .2.J^Al±Iiay 


where = atid + fg yOc^dWg + dWj is a Wishart process with the same parameters 
but starting from 0, R^ = fg X^ds and G ~ Af{0, 1) is an independent standard Normal 
variable. 

Proof. From (15) and (26), we obtain 

A^)i^T-a) = -2™log(r)QT + 21og(r)Qr^ 


Tbx = C 


-1 


Qt 


Xjf X 


~ X ~ {Qt^t + 1 + 


and we are interested in studying the convergence in law of 
rem 4.1 in [29], for p, 0 and T > 1, 

,,2 


N'T J’Ct Rt 


. By Theo- 


dF 

dFe 


= exp 


y/log(7^ 2gTlog(T)) 


pNj' 


defines a change of probability and (^t)tg[o,r] is a Wishart process with degree a + ^ 


under P. Let Ai,A 2 G and 


Aj' = Kg 


exp 


pNx 




2gTiog(T)^ 

By Proposition 5.1, we have 


Ax = K 


exp ( — Tr 
1 


T 


-Tr 


exp — Tr 




T 


-Tr 




a + iJ./yiog(T) 

det]!/] 2 


exp(-—Tr[FV 




T^.+oo det[y] 2 


where 


(41) 


V = (\/2Ai) ^ sinh(Y^2Ai)2A2 + cosh(Y^2Ai), V' = 2 cosh( y^2Ai)A 2 + \/2Ai sinh(Y^2Ai). 

(42) 

We note that this limit does not depend on p and is the Laplace transform of (X®,!?]*) by 
Proposition 5.1. 

We now use that 


^ -7- —TVTTT cL.s., see Lemma C.2 and we define 

Qt log(r) a-(d+l) ’ 


Ax = Kg [exp(^'r)] and ^x = 


p^d 


pNx 

TMy ~ 2(a-(d+l)) 


-Tr 


T 


-Tr 
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that is finite by using equation (73) of Lemma C.2 since ^ We have 


Aj' = Ax + IE(j 


exp 


ji^d 




Xci-{d+l)) 2QT\og{T)) 
The Cauchy-Schwarz inequality gives 


- 1 ^ exp (^t) 


{Ax — Ax)^ ^ Eg 


exp 


^Pd 




2 (a-(d+l)) 2QTlog(r) 


- 1 


Eg [exp {2^x)] 


Since QTlog(T) is positive for T > 1 and converges a.s. to —the first expectation 
goes to 0 while the second one is bounded by using again (73). Therefore, Ax — Ax —t 0, 

r-5>+oo 

and we get 


Eg 


( rp 

exp I — , — ir 


\/iog(r) 


A 2 


Xx 


- Tr 


Ai 

r 2 


Rx 


exp 


fAd 

2{a-{d+l)) 


t-5>+oo det[F]2 


Thus, ( —^3=, converges in law to {■\JR\), where G ~ AA(0,1) is 


, \/log(T) 

independent of X^. From (25), we have 


1 Nx 

QxZx + 1 + d = 2— \og{T)Qx — + a. 






and therefore QxZx + 1 + d converges in probability to a. Slutsky’s theorem gives then the 
following convergence in law: as T —)■ + 00 , 


^ ■, +1 + <:,QT^og(T') 


v^y T' 


)"(/ 


GW»,fl«. a, 


a — (d + 1) 


d 


(43) 

This gives the claimed convergence for {ax,bx) due to the continuity property given in 
Lemma B.l. □ 

Theorem 3.2. Assume that 6 = 0 and a = d + 1. Let {bx, ot) be the MLE defined by (15). 
Then, {T{hx — 6), log(T)(Q;'r — a)) eonverges in law under Eg when T —)• +00 to 


(/:-;(x«-a/.).T). 


where X)) = atld + Jq y^X^dWg + dWj X^ is a Wishart proeess with the same parameters 
but starting from 0, R^ = fg X^ds and xi = inf{t ^ 0, = 1} where B is a standard 

Brownian motion independent from W. 

Proof. The proof follows the same line as the one of Theorem 3.1, but we now write 


log(T)(ci'r — a) = —2 


log(r) 


log(T)^gT + 21og(T)^QT 


log(r) ’ 
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while we still have Tbr = ^ ~ T ~ {QtZt + 1 + d)Idj- By Theorem 4.1 in [29], 

for ^ 0 and T > 1, ^ = exp ^ \og(T) ~ 2 Qt log(r)^ ) defines a change of probability, and we 
define for Ai, A 2 G 

,2 


Ax = Kg 


exp 




f 

" 2QTlog(T)2 


exp ( — Tr 


T 


- Tr 




By Proposition 5.1, we have 

1 


1 


At = - :rTi — 7¥V (-Tr \V'V ^ 

det[y]- 2 - ^ 


x\ 


r-5>+oo det]!/] 2 


where V and V' are defined by (42). 

We now use that i probability, see Lemma C.2, and define 


At = E, [exp(&)l. & = f - 2 QThg{Tr ^ 


A 2 

Y 


Xt 


-Tr 


Ai 

Y 


Rt 


and have 


At = At + Kg 


f uNt fid\ . 


We note that exp(^'r) ^ exp By using Lemma C.2 and the uniform integrability (74), 


we get that At — At —)• 0 and therefore 

T^ + OO 


Kg 


exp 




2QTlog(T)2 


-Tr 


T 


- Tr 




exp{—fj,d/2) 
J t-5>+oo det[l/]t 


Therefore, ^,Qt log(T)^j converges in law to i??, (|)^ T j ^ where ti is indepen¬ 
dent of X^. We observe that QtZt = iog^(r) Qt log(T')^ log^) ' Lemma C.2 and Slutsky’s 
theorem gives 


( log(r) ’ ~Y' + l + d,QT log(r)^^ 


2A^ 1 




which gives the claim by using the formulas for Tbx and log(T)(Q!'r — a). 


□ 


3.2 The MLE of b 

Until the end of this section we consider that a ^ d — 1 is known and study the speed of 
convergence of the estimator of b defined by (18). 


3.2.1 Case 6 = 0. 

Theorem 3.3. Assume that 6 = 0 and a ^ d — 1. For T > 0, let bx be defined by (18). 
When T —)■ -|-oo, Tifix — 6) eonverges in law under Kg to C~^ (X)* — aid), where (X°)j^o is 

the solution to X)) = at Id + /g Y X^dWg + dWj \/ X^ and R^ = Jg X^ds. 
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Proof. From (18), we have Tbx = ~ T ~ ^^d)- Let V and V' be defined by (42). 

Similarly to (41), we have by Proposition 5.1 for Ai,A 2 G 


Eg 


exp — Tr 


T 


- Tr 




detfl/]: 


exp(-—Tr[yV 




T^>+oo det[y] 2 


This gives the convergence in law of to (X)^, R^) and then the claimed result. □ 


3.2.2 Case b = bold, bo > 0. 

In this case b = bold with bo > 0. In order to identify the speed of convergence and the limit 
law, we use the Laplace transform approach. We have the following result. 

Theorem 3.4. Assume that b = bold, bo > 0, and a ^ d — 1. For T > 0 let bx defined by 
(18). When T —)■ +oo, exp(6oT)(6T — b) eonverges in law under to Cf} ^\/XG + G\/X^ 

where X and G is an independent d-square matrix whose elements 

are independent standard Normal variables. 

The proof of this results relies on the explicit calculation of the Laplace transform of 
{Xt,Rt) and is postponed to Subsection 5.2. 

Obviously, the case b = bold is very particular. One would like to consider more general 
nonergodic cases or ideally to be able to state a general convergence results of towards b 
for any b G Sd- Despite our efforts, we have not been able to get such a result. The reason 
why we can handle the ergodic case and the nonergodic case b = bold with 6o ^ 0 is that the 
convergence of all the matrix terms occurs at the same speed, namely 1/VT for the ergodic 
case, 1/T for 6 = 0 and when bo > 0. In the other cases, there is no such a simple 

scalar rescaling. Heuristically, there may be different speeds of convergence that are difficult 
to disentangle because of the different matrix products. To get an idea of this, we present 
now the case of the estimation of 6 when 6 is known to be a diagonal matrix. In this case, 
we obtain different speed of convergence for each diagonal terms. 


3.2.3 The MLE of 6 when 6 is known a priori to be diagonal. 

We assume that a ^ d — 1 is known and that 6 is a diagonal matrix, i.e. 6 = diag(6i, • • • , bd). 
We want to estimate the diagonal elements by maximizing the likelihood. We denote 9o = (0, a). 
As in (17), we have 


t9,% 

1^'jn 


exp 


Tr[6A:r] - Tr[6x] 

2 


1 

2 



TT[b'^Xs]ds - 



By differentiating this with respect to 6*, 1 ^ i ^ d, we get 



r9,9o 




and therefore the MLE of 6 is given by 


(&r)i 


{XT)i,i - Xj^i - aT 


(45) 
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We therefore obtain 


(&T)i - bi 


— Xj^i — aT — 2bi{RT)i,i 


(46) 


Let us observe that this estimator is precisely the one obtained by Ben Alaya and Kebaier [5] 
for the CIR process. This is not very surprising since we know from (4), (2) and b diagonal 
that there exists independent Brownian motions 13^, 1 ^ i ^ d such that 


d{Xt)i,i 


{a + 2bi{Xt)i^i)dt + 2^J~{xi^id|3l. 


Thus, the diagonal elements follow independent CIR processes, and the observation of the 
non diagonal elements does not improve the ML estimation. We can obtain the asymp¬ 
totic convergence by applying Theorem 1 in [4], up to a small correction in the nonergodic 
case which is given by our Theorem 3.4 in dimension d = 1. This yields to the following 
proposition. 

Proposition 3.1. Let a ^ d — 1 and b a diagonal matrix. Let et = diag(e|,--- ,ef) be a 
diagonal matrix with 

( if bi <0 

ej = < t~^ if bi = 0 

[ exp(-6jt) if bi> 0 

Then, under diag((6'r)i — • • • > (&T)d ~ bd) converges in law to a diagonal matrix D 

made with independent elements. Each diagonal element Dj is distributed as follows: 


Vi G {1,' 


.d}, D, = 

law 


tfbi<0 
if bi = 0 
if bi > 0 


X^-a 

2R0 

G 


X 




where Xf = x + at + 2 ^JXfdWg, — fo^^ds, and G ~ AA(0,1) is independent of X. 


4 Optimality of the MLE 

In parametric estimation theory, a fundamental role is played by the local asymptotic nor¬ 
mality (LAN) property since the work of Le Cam [24]. This general concept developed by 
Le Cam is extended later by Le Cam and Yang [25] and Jeganathan [22] to local asymp¬ 
totic mixed normality (LAMN) and local asymptotic quadraticity (LAQ) properties. These 
notions are mainly dedicated to study the asymptotic efficiency of estimators of a given para¬ 
metric model. The aim of this section is to check the validity of either LAN, LAMN or 
LAQ properties for the global model in order to get the asymptotic efficiency of our max¬ 
imum likelihood estimators studied in the previous section. Here we prove these properties 
only for the global model 9 = (6, a) when b is known to be symmetric. The same technique 
applies for all the other cases considered in this paper where we have been able to obtain the 
corresponding local asymptotic property. 
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Let us consider the Wishart process {Xt)t^o G iSj" with parameters 9 := {b,a), with 
a > d + 1 and b G Sd- 


dXt = [al + bXt + Xtb] dt + v^tdWt + dW^ VXt, t > 0 
Xo G S+. 


(47) 


We recall that Pg denotes the distributions induced by the solutions of (47) on canonical 
space C(M+,5^) with the natural filtration := it(X 5 ,s < t) and = Fe\j^x denotes 
the restriction of Pg on the filtration Xt- 

For d > d + 1 and b G iSj", we set 6 = (6, d), 


= 


a — a 


(7W)-i + (6-6)7W, 


and we introduce the log-likelihood function 


£U0) = log I 


/dF 


e,T 


1 


Tr[HsdWs]-^l TT[HsHj]ds 


(48) 


9,T, 

The process {Wt = Wt — fQHjds,t < T) is a d x d-Brownian motion under Pg^^- In 
the sequel, let us introduce the quantity 5t ■= {di;T-,d 2 ,T) G where for i G {1,2} the 
localizing rates satisfy |di,T| 0 when T —)• oo. For all u := {ui,U 2 ) G M. x Sd, we define 
5t • u := {Si^TUi,d 2 ,TU 2 ) G M X Sd- Now, we rewrite (48) with 9 = 6 + 6 t • u 


4 (d + 5t-u) = f 

Jo 


Tr 


dl,TUliV^) 


-1 


-dIT, + 52,TU2V^sdWs 


-y fT j, 

- / Ti [ 82 ,TU 2 Xs52,tU2 ] ds ——Si^tuiTi [ 82 ,TU 2 ] 
^ Jo ^ 


(di,r«i)^Tr [(X^) ds. 


Hence, by using the definitions (13), (23) and (24) of the martingales processes {Nt)t>o and 
{Mt)t>o and the processes {Rt)t>o and {Qt)t>o, it is easy to check that 

£^{0 + 8t ■ u) = + Tr [( 52 ,T^t 2 -Tf 7 ’] ^ — - Tr [82,TU2RTd2,TU2] 


T 1 

— Tr [(52,rW2] — x (<5i,T^ti)^(Qr) 

Z o 

= At{u) - -Tt{u), 


-1 


(49) 


where At{u) = ^(^ 8 i^tUi£Jt + Tic^ 82 ,tU 2 Mt]^ is a linear random function with respect to 
u G M X iSj" with quadratic variation 


rr(n) = Tr[tt 2 -RT] + Tdi,Td 2 ,T'Wi Tr [U 2 ] + ^ 8 f j-ulQrp^. 
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4.1 Case —b G and d+1 

We first consider a > d + 1. In this ergodic case, we set 5i^T = for i G {1, 2}, and we 

get from (19) and (21) 

rr(rt) Too{u) ■= Tr [uIRoo] + ui Tr [U 2 ] + Tr [(Qoo)”^] > as T + 00 . (50) 

This yields the validity of the so called Raykov type condition. Hence, according to Theorem 1 
in [28], relations (49) and (50) ensure the validity of the local asymptotic normality (LAN) 
property, that is under Pe we have 

(Ar(u),rT(n)) ^ (r^"(n)Z,roo(u)) , as r^oo, (51) 

with Z a standard normal real random variable. It is worth noting that the above convergence 
can also be obtained using the proof of Theorem 2.1. In fact, we have already proven that 
under P^ 

(5-^) 

where (G, H) is a centered Gaussian vector taking values in M x Sj" such that 

= [dji{Roo)i,k -\- H“ *^ 2 ^ (-^00 )j,/] ; 

EeiG^jH) = 25ij and EeiH^) = 


Therefore, LAN property (51) follows from relations (50) and (52). 

We now consider the case a = d + 1 and set 6 i^t = T~^ and 82 ,t = T~^/‘^. By using (33), 
we get that under Pe, 


(AT(tt),rr(u)) 


(-^Tr[ 6 ]+Tr 


U2 



.Tr[txii2oo] + 


as T —)■ 00 , 


where T_Tr[f,] defined as in Theorem 2.2 and G is an independent matrix, whose elements 
Gij, 1 ^ i, j ^ d, are independent standard normal variables. Hence, according to Le Cam 
and Yang [25] and Jeganathan [22] this last convergence yields the LAQ property for this 
ergodic case. 


4.2 Case 6 = 0 and a ^ d + 1 

We first assume a > d + 1. From (47) with 6 = 0 and (23), we have Mt = Xt — x — cddT. 
From (43), it follows that as T —)■ 00 


N'T Mt Rt Qt^ \ 
7^’iog(r)J 



d 

a — {d+ 1 ) 


G,Xl-ah,Rl 


d 

a — {d + 1 ) 


where X^ and are defined as in Theorem 3.1. Thus, in the same way as in the previous 
case if we set 5i^t = ^ and 62 ,t = then (A 7 ’(rt), r'r(ii)) converges in law under Pe 


to 



y^„,G+^Tr[u,(X»-a«],TV 


U 2 R 1 


+ 


u\d 


4{a - {d+ 1 )) 


as T 


00 . 
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This ensures the validity of the LAQ property in this non-ergodic case. 
When a = d + 1, we use the notation of Theorem 3.2 and get from (44) 


/ Nj' Mt Rt Qt^ \ 
l^iog(r)’^’^’iog(T)2j 






With 6i^t = ^2,T = T we get that (A 7 ’(rt),r'r(ii)) converges in law under to 


(^^u^ + ^Tt[u2{X<1 


a/d)],Tr 


U 2 R 


0 

1 


+ 




as T —)■ 00 . 


This gives again the LAQ property. 


5 The Laplace transform and its use to study the MLE 

5.1 The Laplace transform of {Xt,Rt) 

We present our main result on the joint Laplace transform of {Xt,Rt), that can be of 
independent interest. This Laplace transform is given by Bru [7], eq. (4.7) when 6 = 0 and 
has been recently studied and obtained explicitly by Gnoatto and Grasselli [17]. Here, we 
present another proof that enables us to get the Laplace transform for any a ^ d — 1, as well 
as a more precise result concerning its set of convergence, see Remarks 5.1 and 5.2 below for 
a further discussion. 


Proposition 5.1. Let a ^ d — 1, x G iSj", 6 G 5^ and X ~ WISd{x, a, 6, Id)- Let v,w & Sd 
be sueh that 

3m G Sd, - — mb — bm — 2m^ G 5^“ and — + m G iSj". (53) 

Then, we have for t ^ 0 


E 


exp ( - ^ TrlwXt] 

_ exp ( - f Tr[ 6 ]t) 
det (t)]t 


exp ( - ^ Tr [(i;',^„(t)K,^(t)“^ + b)x]y 


(54) 


with 


w. 




2 k+l 


00 -s, JU 

yK 


\k=0 


{ 2 k+ 1)1 


w 




A:=0 


(2fc)!’ 


v = v + b^, and w = w — b. 


If besides v = v + b‘^ G we have 14^u,(t) = (\/h) ^ sinh(\/Dt)ui + cosh(\/ht) and then 

Vfy^{t) = cosh(\/ht)t 6 + sinh(\/ht)\/h- 

Before proving this result, we recall the following fact: 


Vx, 2 /G 5^, Tr[xy] ^ 0, (55) 

which is clear once we have observed that Tr[xy] = TI[^/xy^/x\ and ^/xy^/x G We also 
recall a result on matrix Riccati equations, see Dieci and Eirola [11] Proposition 1.1. 
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Lemma 5.1. Let b G and S G . Let ^ denote the solution of the following matrix 
Riecati differential equation 

= bf^ + fb + 5, ^(0) G Sd- (56) 


//^(O) G the solution ^ is well-defined for any t ^ 0 and satisfies f,{t) G . 

Proof of Proposition 5.1. Let T > 0 be given. We first assume w,v G iSj", which ensures that 
E [exp (“2 Tr[wXT + vRt])] < oo. We consider the martingale 


Mt = E 


exp 


— - TiIwXt + vRt\ 


Tt 


t G [0,r]. 


Due to the affine structure, we are looking for smooth functions fd : M+ —>• M, 7 , (5 : M+ —)■ Sd 
such that 


Mt = exp (/3(r -t) + Tr[ 7 (T - t)Xt] + Tr[5(r - t)Rt]). 

We necessarily have /3(0) = 0, 7 ( 0 ) = —wj^ and (5(0) = —r;/2. Ito’s formula gives 

^ ={ - fi\T -t)- Tr[ 7 '(T - t)Xt] - Tr[(5'(T - t)Rt] + Tr[ 7 (r - t){ald + bXt + Xtb)] 
+ Tr[(5(r - t)Xt] + 2Tr[7(r - tfXt]}dt + Tr[ 7 (r - t){^/x~tdWt + dWjy%)]. 


Since M is a martingale, the drift term should vanish almost surely. The drift term being 
a (deterministic) affine function of {Xt,Rt), we obtain the following system of differential 
equations: 

(5' = 0, (57) 

_ y _i_ ^5 _l_ 5 ^ _i_ 27^ -)- ^ = 0 , (58) 

—/?'+ q; Tr[ 7 ] = 0. (59) 


The first equation gives 6 {t) = —vf2. The second equation is a matrix Riccati differential 
equation. We now consider ^ = m —7 with m satisfying (53). It solves (56) with b = 6 + 2 m, 
(5 = —bm — mb — 2mf + v/2 and yo) = m + w/2. We know then by Lemma 5.1 that f is 
well defined for any t ^ 0 and stays in In particular, 7 is well defined for any t ^ 0. 

We set 7 = 7 + 2^. We have 7^ = 7^ — ^{bf + 76) + and thus 7 solves the following 
matrix Riccati differential equation: 


7 ' = 27 ^ — -V, 7 ( 0 ) = “ 2 ^’ V = V + b‘^ and w = w — b. 


We set M(t) = 
that 


Mi(t) M 2 it) 
Ms{t) M^{t) 


= exp t 


M 2 {t) - -Mi{t)w 


0 -v /2 

- 2 Id 0 

1 


G M 2 d and get by Levin [26] 


M^it) - -M3{t)w 


-1 


lit) = 

We check that the matrix M/t{t) — ^M 3 {t)w is indeed invertible. In fact, let 


r = inf 


^ 0 , det 


M^it) 


-M3{t)w 
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We have r > 0 and for t G [0,r), ^[M 4 (f) 


dt 


det 


M 4 {t) 


1 

2 




\M^{t)w\ = 


2 det 


M 4 {t) 


-2 [M2{t) - 


and thus 

Tr[ 7 (t)]. 


This gives det \M 4 {t) — \M' 4 {t)w'\ = exp(—2 Tr[ 7 (s)]ds) > 0, and we necessary get r = +oo 

since 7 and thus 7 is well defined for t ^ 0 . 

Since 


1 » 

1 

to 

_ 1 

2 k 

' O' 

and 

0 

1 

to 

2 fc+l 

1 

to 

0 J 


0 


0 

CNl 

1 



0 -v^+^l2 

- 2 v’‘ 0 


we get 


00 "fc 

Mi(t) = M4(t) = M2{t) 

k =0 ^ '' 


1 

2 


00 


E 


^2/c+l ~/c+l 

( 2 A: + 1 )! ’ 


Msit) 


00 


- 2 E 

fc =0 


^2fc+l~fc 

(2A: + 1)!' 


If h = -u + 6 ^ G is well defined and we have Mi{t) = M 4 (t) = cosh(f\/D), 

M 2 {t) = — sinh(f\/D) and M^{t) = —2(-v/^)~^ sinh(f\/^)- Now, we define 


V{t) = M 4 {t) - -M 3 {t)w = 


2 k+l 


\k =0 


{2k + l)\ 


w 


00 

2 k V 


+ E‘^ 


k =0 


{2k)V 


Since V'{t) = — 2 M 2 {t) + we obtain that 

7 (t) = -^V'{t)V(t)~^ and thus 7 (t) = (y'{t)V{t)~^ + b) . 

Last, we have = —^aTr[V'{t)V{t)~^] — 2 Q!Tr[ 6 ] and we obtain that 

Pit) = -^«log(det[y(t)]) - ^aTr[ 6 ]t, 
since _ (Iet[y(t)] Tr[W(t)F(t)“^]. 

It remains to show that we indeed have (54) for v and w satisfying (53). We define 

exp(/3(T-i)+Tr[7(r-LW]+Tr[-|Rt]) , 

St = - exp(/ 3 (r)+Tr[ 7 (T)x]) By Ito s formula, we have 


7 ^ 

= Tr[ 7 (r - t){^/x~tdWt + dWj v^)]. 
it 


This is a positive local martingale and thus a supermartingale which gives ^ 1, and we 

want to prove that this is a martingale. To do so, we use the argument presented by Rydberg 
in [34]. For L > 0, we define 


tl = mf{t ^ 0,Tr[Xt] ^ L}, 

and TTiix) = a:lxr[a;] 7 L + Traa^lTr[x]>L for X G . We consider (8^,1 G [0,T]) the solution 
of 

dSt^ = S^ Tr[7(r - t){^jT:LiXt)dWt + dWjS^ = 1. 
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We clearly have = 1. Besides, under given by = £t, the process 

= dWt - 2y/7rL{Xth{T - t)dt, t G [0, T], 

is a matrix Brownian motion. Since St = S^ for t ^ tl, we have E[£ir] = E[<£’y >r]+IE[£iTlTi^r]- 
By Lebesgue’s theorem, we get —)■ 0. On the other hand, E[£i^ lr,>T]=PHrL>T). 

L^+oo 

Let us consider the Wishart process X starting from x such that 


dXt = 


aid + {b + 27 (T - t))Xt + Xt{b + 2-f{T 


We also define tl = inf{f G [0, T], Tr[Xi] ^ L} with convention inf 0 = +oo. The process X 
solves the same SDE on [0, tl A T] under P as X on [0, tl A T] under P^. We therefore have 


P^(tl > T) = P(fL > T) ^ 1, 

L —^~1“CXD 


which finally gives E[<S 7 ’] = 1. □ 

Corollary 5.1. Let Y ~ WISd{y,a,b,a) be a Wishart proeess with parameters a d — 1, 
y G iSj", a, 6 G Aid satisfying 


ba^a = a~^ab~^ and a invertible. 


(60) 


Let v,w G Sd be sueh that 

3m G Sd, -awa~^ + m G and — - - ab'a~'^m — m{a')~'^ba' — 2m^ G . (61) 

Then, we have 


E 


exp ( “2 Tr 


wYt + v Ygds 

Jo 


^ ~ ^ [iylwit)yv,w{t) ^ + (a^) ^6a^)(a^) ^ya ^]), 

det[K,«>(oJ 2 ^ 2 , J 


with W,^(t) = (j2T=oi‘^'"^^jiy.) dJ + 

V = ava~^ + {a~^)~^b^a~^, and w = awa~^ — {a~^)~^ba~^. 

Proof. We know that y = a'''Xa with x = (a''')“^ya“^ and X ~ W/5rf(x, a, (a''')“^6a''', I^), 

law 

see e.g. equation (13) in [1], We notice that {aJ)~^baJ = ab~^a~^ ba~^a = a~^ab~^ and 

thus (a''')“^6a''' G Sd. We have 


E 


exp ( “2 


wYt + v Ygds 

Jo 


= E 


exp ( “2 


awa~^X t + ava~^ / Xgds 


which gives the result by applying Proposition 5.1. 


□ 
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By setting m = a ^m{aJ) the condition (61) is equivalent to the existence of m G 5^, 
such that 

1 ‘X) 

-w + m G iSt and - — rh — mb — 2mcJam G 5t. (62) 


The case m = 0 gives back the finiteness of the Laplace transform when v,w ^ . If we 

take fh = —wl2, we get also the finiteness when 

V + w + wb — waJaw G iSj". (63) 

Another interesting choice is m = —^{aJ)~^ba~^. We have m ^ from (60). This choice 
gives the finiteness of the Laplace transform when v+l^ {aJa)~^b G and w—{aJa)~^b G iSj". 
Let us note that v = a{v + b~^{a~^a)~^b)a~^ so that the first condition is the same as -D G 
Another interesting choice of m is given by the next remark. 

Remark 5.1. Proposition 5.1 extends the result of Gnoatto and Grasselli [17] to a ^ d — 1, 
and the suffieient eondition (61) that ensures the finiteness of the Laplaee transform is also 
less restrietive, whieh is erueial in our study espeeially in the nonergodie ease. In partieular, 
it does not assume a priori that v + b~^{aJa)~^b G iSj". We ean reeover the result of [17] as fol¬ 
lows. Let us assume v+b~^ {a~^ a)~^b and take m =+^^ya{v + b~^ {aJ a)~^b)aT. 

We have m £ Sd from (60) and it satisfies ^ - ab~^a~^m — m{a~^)~^ba~^ — 2mfi = 0 G . 

Therefore, (61) holds if 

w — {a~^a)~^b + a~^\Ja{v + 6 '''(a''~a)“^ 6 )a''~(a''~)“^ G 5^. 

This is preeisely the eondition stated in [17]. 

Remark 5.2. It is possible to get similarly the Laplaee transform of (Yt,Jq Ygds^ when Y 
solves 

dYt = [a + bYt + Ytb~^] dt + WtdWta + Wt, Yq = y £ Sj, 

with a,b satisfying (60) and a — {d — l)a''~a G 5^". Again, equation (13) in [1] gives 
Y = afi Xa, where 

law 

dXt = [d + bXt + Xtb^] dt + \fXtdWt + dWjVXt, Xq = x, 

with X = {aJ)~^ya~^ £ Sd, b = {aJ)~^bafi £ Sd and a = (a''~)“^aa“^ G Sd. Repeating the 
proof of Proposition 5.1, we observe that the Rieeati equation (58) and equation (57) remain 
unehanged while (59) is replaeed by 

13 ' = Tr[d 7 ] = -^ TT[aV'{t)V{t)-^] - ^ Tr[d6]. 

Therefore, we deduee that under the same eondition (61), we have 


E exp ( — - Tr wYt + v Ygds 


= exp(/3(r))exp (^ - ^Tr [(V;'_^(t) 14 ,^(t) ^ + ( 0 "^) ^ba"'){a"') ^ya 



5 THE LAPLACE TRANSEORM AND ITS USE TO STUDY THE MLE 


26 


with I3{t) = —^ Jq | Tr[a(a''~a)“^6] and Vjj^w{t) defined 

as in Corollary 5.1. Thus, the formula is no longer totally explicit. In Gnoatto and Gras- 
selli fl7], the result is stated with Tr[(a''~)“^aa“^ log(14^u,(t))] instead of the first integral. 
However, this replacement does not seem clear to us unless Vf ^{s) and Vy^wis) commute for 
all s 0 (this happens when the matrices v and w in Vy^y, commute) ora = aa~^a by using 
the trace cyclic theorem. 

Corollary 5.2. Let Y ~ WISd{y,C(,b,a) be a Wishart process with parameters such that 
baJ a = aJ ab~^ and a invertible. Then, 

Vu € Sd, IE exp Tr[uy/YsdWsa]ds — - j Tr[auy'sua''~](is^ =1. 

Proof. We have 2 TT[u^/YsdWsa]ds = TT[u(YT—y)]—aTTT[ua~^a]—Ti {ub + b~^u) Ygds 

We apply Corollary 5.1 with w = —u and v = ub + b~^u + uaJau. Therefore, (63) holds. 
We then have w = —{aua~^ + and v = ufi and the result follows by simple 

calculations. □ 


5.2 Study of the MLE of b with the Laplace transform 

We consider e : M+ —)• a (deterministic) decreasing function such that limj^_l_oo = 0 . 

From the definition of the MLE of b ( 18 ), we get that 

— {br — b) = {eT[XT — x — aTId — bRr — RTb]). 

t.rpiX'J' 

Thus, we want to calculate the Laplace transform of — x — aTId — bRx — RTb], c^Rt) 

in order to study the convergence of ^{bT — b). For Ai, A2 G Sd, we define 

£{T, Ai, A2) := IE51 exp - eTTr[\2{XT - x - aTId - bRr - RTb)] - Tr[Aii?T]) ( 64 ) 
= exp (eTTr[A2(x + aTId)]) exp ^ - Tr[eTA2^T] - Tr[(eyAi - eT{\2b + bX2))RT]J ■ 


We now consider Ai, A 2 G Sd such that 


Ai-2AiG5+’*. 


We define 


vt = 2 Aie ^ — 2 ( 6 A 2 + X2b)eT, vt = vt + b^, vjt = 2 A 2 e 2 ’, wt = vut ~ b, ( 67 ) 

and have vt + bwT + WTb — = e^(2Ai — dA^) G Thus, by applying Proposition 5.1 

with m = —€tX 2, we get that £{T, Ai, A 2 ) is finite and given by 

£iT, Ai, A 2 ) [(K;,«,,(r)W,,^,(T)-i + 6)x]) 

(r)J 2 ^ 

X exp ^erTr [A 2 (x + aT/rf)]^ (68) 



5 THE LAPLACE TRANSEORM AND ITS USE TO STUDY THE MLE 


27 


with 


(T) =( 1 /^) ^ smli{y/^T){uT + cos h(v^r) 
= cosh.{y/^T)wT + sinh(v^T) v^- 


Besides, we have vt = {b — 2eTA2)^ + e|,(2Ai — dA^) G 

When —b G 5j"’* and ct = IjVT, we can make explicit calculations and get 

lim £{T, Ai, A 2 ) = exp(- Tr[Aii?oo] - Tr[2A|i?oo]), 

T^+00 

which gives another mean to prove Theorem 2.4. Here, we prove Theorem 3.4. 

Proof of Theorem 3.4- Here, we focus on the case b = bold with bo > 0 and set ex = 6“^°^. 
Since the square root function is analytic on the set of positive definite matrices (see e.g. 
[33], p. 134) we get that 


— bold — 2e'rA2 + “ 2A2) + 0(ey), 

bo 


since the squares of each sides coincides up to a O(e^) term. We observe that wt = 2eT-\2—boJ^d, 

and thus ^/vT + wt = ^(-^1 “ 2A|) + O(e^). 

We now write 


(T) =(v^)-' - ex.p{y^T){y^ + wt) + - exp( —y^^T)(y^^ — wt) 
=:^exp(v^T)(v^ + wir) + ]. exp{-y^T){wT - v^)- 


Since erewiV^T) Id, we get ^Vvt,wt{T) ^ 

T^+00 -( x^+00 

^ ^ “ 2 ^ 2 ) - boh- This yields to 

1 —>- + oo 


2|^(Ai - 2A2) + bold 


and 


+ bold (Ai - 2A2) (^(Ai - 2A2) + bold 


-1 


We also have 
fore 


exp(-f Tr[feoJrf]T) ^ 1^ ^_ 

(X)]^ <iet[e,j. T —^+oo det ^ — 2A2)+foo.^d 


, and there- 


lim T(T, Ai,A 2 ) = 

T^+oo 


exp(-iTr 


(Ai-2Ai)(^(Ai-2Ai) + /,)“\ ) 


det 


2b, 


^(Ai — 2A2) + Id 


(69) 


We now want to identify the limit. We know that X WISd[^^,a,0,Id-,^J has the 
following Laplace transform 


u G E[exp(—Tr[ttW])] = 


exp 


(“Tr u{Id+^^u) ) 


det[/t/ + -^y] 



6 NUMERICAL STUDY 


28 


Let G denote a d-square matrix independent from X, whose entries are independent and 
follow a standard Normal distribution. By Lemma B.4, we have 

E[exp(- Tr[XiX + X 2 (VXG + GVx)])] = E[exp(- Tr[(Ai - 2Xl)X])]. 

Thus, (69) shows the convergence in law of (er(^r ~ X — aTId — IRt — Rrb), to 

'/XG + Gy/x'^ under Eg, which gives the claim of Theorem 3.4. □ 

6 Numerical Study 

In this section, we test the convergence of the MLE given by (15) and (18). To do so, we 
consider a given large value of T and simulate the Wishart process exactly on the regular time 
grid ti = ^, i = 0, ■ ■ ■ , N. This can be done by using the method presented in Ahdida and 
Alfonsi [1], see also Alfonsi [3]. We take N sufficiently large and approximate the integrals Rt 
and QTp^ applying the trapezoidal rule along this time grid. Thus, we will use the estimator 
with the exact value of Xt and these approximated values of Rt and QTp^. 

This section has three goals. First, we check numerically the convergence results that 
we have obtained. Second, we investigate numerically the convergence of the MLE in some 
nonergodic cases, where no theoretical result of convergence have been found. Last, we test 
the estimation of the parameters of a full Wishart process (1). To do so, we estimate first a 
with the quadratic variation and then the parameters a and b by using the MLE (15) on the 
process (a''')“^Aa“^. 

6.1 Numerical validation of the convergence results 

Using the method mentioned above, we have checked the convergence results obtained in 
this paper. Namely, we sample M = 10000 independent paths of X in order to draw an 
histogram of the properly rescaled value of 6* j- — bij or a — a. We do not reproduce all these 
graphics here, and present for example in Figure 1 an illustration of the convergence given 
by Theorem 3.4. 




(a) Limit law of exp(0.05T)(5 — 6 ^) 1 ,i- (b) Limit law of exp(0.05r)(6 — bT)i, 2 - 


Figure 1: Asymptotic law of the error for the estimation of 0 = b with for: x = (q^ qJ), 
T = 100, N = 10000, a = 4.5 and b = 0.05/^. 
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6.2 Experimental convergence in a nonergodic case 

In this paragraph, we try to guess the asymptotic behavior of the MLE in a nonergodic 
case, where no theoretical convergence result is known. Namely, we observe in Figure 2 
the asymptotic estimation error, when b = diag(0.1, 0.005) is diagonal with positive and 
distinct terms on its diagonal and when we use the estimator (18). As one might have 
guess, the convergence of the diagonal terms seems to be with an exponential rate, with the 
exponential speed corresponding to its value. Namely, bu seems to converge to bn with a 
speed of exp(O.lT) while 622 seems to converge to 622 with a speed of exp(0.005T). More 
interesting is the antidiagonal term. One could have imagine that the convergence rate is 
the slowest of these two rates. Instead, on our experiment, the convergence of 612 towards 
612 seems to happen with the rate exp(O.lT). We have observed the same behaviour for 
other parameter values. Of course, it would be hasty to draw a global conclusion from few 
particular experiments. However, it is interesting to note that these numerical tests are a 
way to guess or check the convergence rate of the MLE. 



(a) Limit law of exp(0.1T)(6 — St)i,i. (b) Limit law of exp(0.005T)(6 — S'r) 2 , 2 - 



(c) Limit law of exp(0.ir)(6 — St)i,2- 

Figure 2: Asymptotic law of the error for the estimation ol 6 = b with x =( g 2 ), T = 100, 
N = 10000, a = 3.5 and b = diag(0.1, 0.005). 

6.3 Estimation of the whole Wishart process 

In this last part of the numerical study, we perform the estimation of all the parameters of the 
Wishart process (1). We consider a case where a is upper triangular and {aJ)~^baJ is sym¬ 
metric. We proceed as follows. First, we sample exactly a discrete path (X^rpij^^Q ^ f ^ N). 
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Then, we estimate the matrix aJa by using (3), where the quadratic variations are replaced 
by their classical approximations and the integrals are replaced by the trapezoidal rule. By 
a Cholesky decomposition we get then an estimator a of a. Then, we use the MLE (15) on 
the path {{aJ)~^Xirp/]^aJ,0 ^ i ^ N). This gives an estimator of a and and 

therefore an estimator of b. As a comparison, we also calculate similarly the estimator of 
a and b when a is known and has not to be estimated. To draw histograms or calculate 
empirical expectations, we run M = 10000 independent paths of X. 

We consider a sufficiently large value of T and are interested in looking at the convergence 
with respect to N. First, we plot the the error on the estimator of a with respect to the 
number of time step in Log-Log scale. We observe that the convergence to zero takes place 
with experimental rate close to 1/2. This is in line with the general results on the estimation 
of the diffusion coefficient, see Dohnal [12] and Genon-Catalot and Jacod [15]. Then, we 
focus on the influence of the discretization and the unknown parameter a on the convergence 
of the MLE of b and a. In Table 1, we give in function of N the Mean Squared Error 
MSE(6l^|0) = E[|0'^ — 6*p] of the estimator 9^, with 9 = {b,a). It is estimated with the 
empirical expectation. First, we observe that the convergence of the estimator of a is roughly 
the same whether we know a or not. This is expected since the estimation of a does not 
depend on the estimation of a. Instead, the bias on b is much higher when a is estimated 
than when a is known. However, it decreases also faster at an experimental order of 0.7 while 
the bias when a is known decreases at an experimental order of 0.45. This latter rate is in 
line with the rate of 1/2 obtained in dimension 1 by Ben Alaya and Kebaier [5]. In our case, 
it seems that the influence of the estimation of a vanishes around N = 5000. Last, we have 
plotted in Figure 4 the limit law of the estimator 'JT{9^ — 9) with N = 10000. 

This short numerical study shows that the estimator obtained by discretizing the con¬ 
tinuous time estimator is efficient in practice. Of course, it would be nice to obtain general 
convergence results in function of T and N, but we leave this for further research. 



Figure 3: Log-Log representation of the empirical expectation of E[Tr[(a — for 

X =(o5°i^), T = 100, a =(q 2 ), a = 4.5, b ={~2 - 2)1 where the line is the simple linear 
regression i.e. log(E[Tr[(a — a^)^]]^/^) Ri 2.62 — 0.581og(A^). 
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Table 1: Mean Squared Error for the estimation of 0 = {b,a) with respect to N. Same 
parameters as Figure 3. 



-5 - 4 - 3-2 -10 1 2 3 4 


(a) Limit law of \/T{h — (b) Limit law of \/T{b — b^) 2 , 2 - 



(c) Limit law of VT{b — (d) Limit law of VT{a — d^). 

Figure 4: Asymptotic laws of the error for the estimation of 0 = (6, a) for a = , N = 10000, 

same parameters as Figure 3. 
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A Proof of Proposition 1.1 

We denote = {b + b~^)/2 (resp. 6“ = {b — b~^)/2) the symmetric (resp. antisymmetric) part 
of b. We have 

£ TVKv^)-VM41 = 1 log (!^) - TrMT - 1 £ (a - 1 - d) T.[X,-|*. 

f = t y Ti{b\s/jr.dW. + dwj^JY,)\ 


Thus, the only part to calculate is E 


ex' 


p(f£ Tr[-6“V^dW,]) 


JE 


X 


and -we set M“ = f* Tv\b‘^y/JCgdWs] ■ 


We now observe that (^x[AsdWs\jTT\BsdWs\) = Tr[As-B7]ds and are looking for the process 
r that takes values in Sd and minimizes 

{dMt-Tr[Tt{dXt-{ah+bXt+Xtb~^)dt)]) = {- Tr[6“W6“] + 2Tr[rt(W6“ - 6“W)] + 4Tr[r2w]} dt. 
We obtain that 2{Xth^ — b°‘Xt) + A{XtTt + TtXt) = 0 and thus 

It satisfies Ti[Tt{Xtb^ - b^Xt)] = -2Ti[TtiTtXt + XtTt)] = -4Tr[r2Xi]. By construction, 
we have {dMjl-TT[Tt{dXt - iaId + bXt +Xtb'^)dt)],Tr[r{dXt - {aId + bXt +Xtb'^)dt)]) = 0 
for any T £ Sd- Thus, there exists a Brownian motion (3 independent of X such that 
dM^ - TriPtidXt - {aid + bXt + Xtb'^)dt)] = £- Tr[6“W6«] - TV[ri(6«W - Xtb-)]d/3t. In 
fact, both processes (W,/q £— Tr[6“Xs6“] — Trjrt(6“Xs — Ws6“)]d/3s) and 


{Xt, Mf - f Tt[Ts{^/x~sdWs + dWj v^)]) 
Jo 


/o 

solve the same martingale problem for which uniqueness holds. Therefore, we have 

\ 


E 


exp 


'0 


Tr[6“v^dlW] 


J 


= exp 


= exp 


Tr 

cT 


Tt{{ald + hXt + Xtb~^)dt - dXt) 


1 


Tr 


+x 


>0 


Tr 




Tr[6“W6“] + Tr[ri(6“W - XtW)]dt 


Tr[6“Xi6“]dt 


C-^] ( ^{h^Xt - Xth^) ) (6“W - A6“) 


df + ^ Tr[6“Xi6"]dt^ , 


since Tr [£3^^ Q(b'=‘Xt - Wt6“))] = J Tr[Xj“^(6“W - W&“)] = 0 by Lemma B.l and 

Tr[ri(6^W + Xtb^] = TT[b%rtXt + WEt)] = ^ Tr [6*(6“W - Xtb^)] = Tt[b^Xtb% 
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Using (6) and the previous calculations, we obtain 


L^^° = exp 


- CWr) " 

- _ 1 _ jl^TrlX.-l]* - ^Tr[6] 

+ t / Ti' gxi (6“V, - X,6“) dX,] - J / Tr gi) (6“X, - X,6“) (6“X, - X,6“)] *). 

Last, we use {VXt + Xtb^) = 6" and Tr[£-^ {b^Xt - Xtb^) {b^Xt + Xt6*)] = 2 Tr[6“Xt6*] 
to obtain (7). 


B Technical lemmas 


Lemma B.l. For X G and a ^ 0, let Cx,a md Lx = ^x,o linear appli- 

eations defined by (12) on Sd- If aTr[X“^] 1, then Lx,a is invertible and we have 

Tr[£^^^(y)] = 2(i-^i'^[jr-i]) ' Besides, the map {X,Y,a) i—)■ Lf^^^{Y) is eontinuous on 
{{X,Y,a) G 5+’* X X M+, aTr[X-i] / 1}. 

Proof. The invertibility of Lx,a is equivalent to its one-to-one property. Since X £ S^’ , 
there exists an orthogonal matrix Ox and a diagonal matrix Dx with positive elements such 
that X = OxDxOjr. We get 

F G ker(£x,a) ^ OxDxOIy + YOxDxOl = 2aTr[y]/rf 

^ DxiOlYOx) = 2aTr[y]/rf - {Oj,YOx)Dx. (70) 


Since Dx is diagonal, we obtain for 1 ^ i,/c ^ d, [{OjrYOx)Dx) ■ = iO]^YOx)i,k {Dx)k,k 

and {Dx{Oj.YOx))i ,^ = {Dx)i,i{Oj.YOx)i,k- For k i, (70) gives {0]^Y0x)i,k = 0. For 
k = i, we get {Oj^YOx)i,i{Dx)i,i = aTr[y] and therefore 


Tr[y] = Tr[OTyO,,] = Tr[y]a^ 


Tr[y]aTr[X-^]. 


Since aTr[y 1, we obtain Tr[y] = 0 and then {0]^Y0x)i,i = 0, which gives F = 0 and 
the invertibility of Lx,a- Let c = Lf^\(Y). We have c -|- X~^cX — 2aTr[c]X“^ = X~^Y, 
which gives 2(1 —aTr[X“^]) Tr[c] = Tr[X“^y]. Last, the continuity property is obvious since 
{X, a) I—7- Lx,a is continuous and L i—?■ L~^ is continuous on {£ : 5^ —?■ 5^ linear and invertible} 

□ 


Lemma B.2. For X G Lx is self-adjoint and positive definite: 

Ti[Lx{Y)Y]>2X{X)Tt[Y% 

where A(y) > 0 is the lowest eigenvalue of X. Besides, for a < l/Tr[X“^], Lx,a is self- 
adjoint and positive definite. 
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Proof. For Y, Z e Sd, we have Tt[Cx{Y)Z] = Tv[{XY+YX)Z] = Tt[Y{XZ+ZX)] = Tv[YCx{Z)] 
and lY[Cx{Y)Y] = 2Tt[XY^] > 2 X{X)Tt[Y^] since X - X{X)Id G S^. The self-adjoint 
property is then clear for Cx,a, and the positive definiteness comes from Lemma B.l and the 
continnity of the eigenvalnes of Cx,a with respect to a. □ 

Lemma B.3. For X G 5+’*, Y G Md, Cx{Y) = Cf}{YX + XY'^)X is self-adjoint and 
positive. The linear applieation Cx,a(X) = + XY'^)X — aTiX\X is also positive 

for a < 1/ Tr[X“^], and there is a positive cx,a > 0 sueh that 

Tr[Cx{Y)'^Y] > cx,aTr[{Cf,\YX + XY^)f]. 

Proof. Since is self-adjoint, we have for Z ^ Aid 

Tr[Cx{Yy Z] =Tr[Cfl{YX + XY^)ZX] = ^Tt[C]:.\YX + XY^){ZX + XZ^)] 

= ^ Tr[{YX + XY^)Cf^^{ZX + XZ^)] = Tr[y^£x(^)]. 

Similarly, Ti[Cx,a{YyZ] = Tv[Cx{YyZ] -aTr[y]Tr[y] = Ti[Y^Cx,a{Z)]. Besides, we 
notice that 

Tr[£x,a(T)^y] = ^ (TV[(yy + XY^)C-y{YX + XY^)] - 2aTr[y]2) 

= \ {TT[CxAC-y{YX + XY^))C-y{YX + XY^]) 

by Lemma B.l. This gives the claim since Cx,a is positive definite by Lemma B.2. □ 

The following lemma gives the Laplace transform of the matrix Normal distribntion. 

Lemma B.4. Let C G S^* and C[C'] G defined by 

Q[C]i,j,k,l = SikCj^l -|- dilCj^k + SjkCi,l + ^jlCi,k- (71) 

ITe introduee the Aid-valued random variables G and G ~ A7(0,C[C']) of whieh eompo- 
nents are Normal random variables with mean 0 sueh that 


Vi, j,/c, Z G {1,..., d}, K[GijGkfi — dikSji, K[GijGkfi — Q[C]ij^k,i- (72) 
ITe have the following results. 

1. For all c G Sd, E[exp(—Tr[cG])] = exp(2Tr[c^C']). 

2. For C G Aid sueh that CC^ = C, CG -|- G'^'C"'" and G have the same law. 

3. Let X G Sy*. For c G Sd, E[exp(-Tr[c£3^^(\/XG + G^v/Z)])] = E[exp(TV[c£^^(c)]) 
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Proof. We focus on the first point. For all c G S^, we have 

E[exp(-Tr[cG])] = E[exp(- ^ ajGij)]. 

Moreover, is a Normal random variable and its variance is given by 

E[( ^ CijGij)^]= Ci,jCk,ie[C]ij^k,i = ^Tr[c^C]. 

It follows from the moment generating function of the Normal distribution that 

E[exp(—Tr[cG])] = exp(2 ^[0^(7]). 

To prove the second point it is sufficient to notice that Tr[c(CG + G'''C''')] = Tx[2cC&\ 

and 

m E (^C)ijG,jf]= icC)ij{cC)k,idikSji = E icC)lj = TT[cCC^c\. 

l^ij^d l^ij,k,l^d l^ij^d 

For the third point, we set Z = £^^(\/XG+G''~\/X) and haveXZ+ZX = \fXG+G'^'/X. 
We also introduce c = and have cX + Xc = c. Thus, we obtain 

Tr[cZ] = Tr[(cX + Xc)Z] = Tt[c{VxG + G'^Vx)] 

and therefore E[exp(—Tr[cZ])] = exp(2Tr[c^X]) = exp(Tr[c(cX + Xc)]) = exp(Tr[cc]). 

□ 

C Some asymptotic behaviour of Wishart processes 

Lemma C.l. Let X ~ WISd{x,a,b,Id) with b G Sd, x G and a ^ d — 1. Then Xt 
eonverges in law when T -G +oo if and only if —b G S^' . In this ease, Xt converges in law 
to WISd{0, a, 0, 1/2). 

Let X ~ WISd{x, a, b, Id) with b G Aid, x G and a ^ d — 1. If —{b + b~^) G , 
Qoo ■= ds is well defined and Xt eonverges in law to WISd{0,a,0,V^2qfo', 1/2). 

Proof. Let us first consider the case —b G From Proposition 4 in [2], we have for 

V GSj, 

exp ^Tr —V (id + 2 e^’^^ds) e'^^xe'^^ j 

E[exp(-Tr[nXT])] = -^^^ 

det Irf + 2 f Jq j v 

1 

T-5>+oo _ 5-1^]“/^ 

which is the Laplace transform of lT/5rf(0, a, 0, V—b~^', 1/2). Now, let us consider —b 0 
Then, there exists an eigenvector n G \ {0} such that bv = Xv with A ^ 0. Then, we have 
^E[v'^Xfv] = av~^v + 2AE[FXjr;], and therefore 'E[v'^X tv\ ^ -G + 00 . 
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In the case b G with —{b + b~^) G we know that the norm of decays 

exponentially to 0 as s —+oo, see e.g. Problem 11.3.6 in Golub and Van Loan [18]. Using 
again Proposition 4 in [2], we get that E[exp(—Tr[r;V 7 ’])] —)■ — □ 

T—>-+oo det[7d+2i}oo'w] ' 

Lemma C.2. • Assume a > d + 1 and 6 = 0. Then, , —)■ —tVttt cl.s. Besides, 

’ dlog(T) 2’=.+oo ’ 

eonverges almost surely to d, and we have 


V/U > 0,supE 
T^2 


exp 




< oo. 


(73) 


• Assume a = d 1 and 6 = 0. Then, as T ^ +oo, j Qt^ eonverges in law to 

Ti = inf{t ^ 0,Bt = 1}, where B is a Brownian motion. Besides, = \og{T) converges 

in probability to d, and we have 


V/U > 0, supE 
T^2 




< OO. 


(74) 


We mention that the results on the convergence for Qt are given in Donati-Martin et 
al. [13]. However, their proofs is in a working paper by the same authors that we have not 
been able to find. For this reason, we present here an autonomous proof. 

Proof. We first consider the case a > d + 1. We have dXt = alddt + \fXtdWt + dW^\OCt 
and thus 


d{e-^X,t_V 


[o;L(;/ — e ^X^t_-^^dt P 


e-^X^t_idWt + dIU/ 



with dWt = e“*/^d(IFet_i). We observe that IF is a matrix Brownian motion, which gives 
Y ~ WISd{x,a,—Id/2,Id), where L) = e~^X^t_i for t ^ 0. Using equation (25) to the 
process Y, we get 

J (sSlI) = ‘ Td\AU<i>r.l. (75) 

Since Y is ergodic and (f* Tr[\/yv^dIUs]) = Jq Tr[yv^]ds, we get that the left hand side con¬ 
verges in probability to zero and the right hand side converges a.s. to (a—1 —d)E[Tr[y)^^]] —d, 

where Y^o ~ WISd{0, a, 0, \/2Id] 1/2) is the stationary law of Y. Therefore, j log 


converges a.s. to zero. Since = \ log 

we get that log converges a.s. to d when T -h(X). 

Now, we use (25) taken at time T = e* — 1 and Dubins-Schwarz theorem: there is a 
Brownian motion fd such that for all f ^ 0, 

a-(l + d) , 


= 7 log 


d et[X^t_ J 
det [xl 


- d. 


+ 


t 


= bog 


This gives that T'f 


Qet — it 

—7- d a.s., and therefore 

ii —^~|“CXD 


/ det[Ve*-i] 

\ det[x] 


Qn 


dlog(T) 2’-s.+oo 


a.s. 
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It remains to prove (73). From (25), we have Nt = ^ — " I ^ and thus 


E 


exp 




^ E 


/ det[XT] \ 2 ^los(T) 
I det [a;] J 


y/logiT) 

Again we set t = log(T + 1), and for A G [0,1 

iVT = y Tr[^/xF^dWs\ = j Trl^/vF^dWs] 

r-t 


2 — 2 
< oo, since the moments of X are bounded. 


we have from (75) 


1 


= A / Ti[\^ Ys-^dWs] + (1 - A) { - log 
Jo 

By Cauchy-Schwarz inequality, we get 


det[x] ) 2 2 Vo 


E 


exp 


\/iog(r +1) 


Nx 




/det[lt]y^ 


X E 2 


exp 


\ det[x] ) 


-IjTJ/l A \^ ^ f m„rAV-ll 


Tr[y-Vs 


We now take A = At = ^ (~1 + \/l + 4et) with et = i^d)Vt order to obtain 
2 ~ ~ We note that for t large enough. At G [0,1]. Besides, we 


V Vt ) 

have At ^ = 1 — et + o(l/t), so that \/t(l — At) converges to From Theorem 4.1 

in [29], the second expectation is then equal to 1, while the first one is bounded since Y is 
ergodic. This yields to (73). 

We now consider the case a = d + 1. We set again t = log(l + T) and have T = e* — 1. 
Thus, 

'detje^t] 
det [x] 

Again, It converges in law towards WISd{0, a, 0, \/2/(t; 1/2). Therefore, the ergodic theorem 


Zx = 




= log 


det[lt]\ 

det[x] ) 


T dt. 


gives that j log ^ ) converges in probability to 0, which yields to the convergence i 

probability of to d. We now turn to the convergence of (^,tTog(T)) Qx^ ■ know from 
Theorem 4.1 in Mayerhofer [29] that for T > 0 and A ^ 0, 


E 


exp 




2A 


(2A)^ 


Vdiog(i+ r)^'^ 2d2iog(i + r)2^^ 


= 1. 


From (25), we have Nx = ■^r/2 and we write 

(2A)2 


1 =E 


exp A — 


-1 


+ E 


exp 


2d2 log(l + r)2^^ 
(2A)2 


2d2 log(l + T) 


We now observe that exp ( ~ 2^2 iig'(i+r )2 Qx^ ) ^ ^ ^^d that 




exp 




2A 


Ulogd+ T)^-J 


A-l 


E 


exp 


2A 


(dlog(l+T) 


Nx 


= E 


/ det[X'r]A tUog(i+x) 
\ det[x] J 


= e^E 

/ det[lt]\ 


\ det[x] ) 
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Since Y has bounded moments and is stationary, sup^^j^ E 
uniform integrability (74) and that 


/ detlYj] \ dt 
^ det[fc] J 


< oo. This gives the 


E 



_W_q-A 

2d2log(l + r)2^^ J 



2A 


(diog(i + r) 



exp(A) 


T^+oo 


0 . 


Therefore, lim^-^+oo E 
gence in law. 


exp (A 


/O— 

2d2 log(l+r)2 J 


1, which gives the desired conver- 

□ 
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